# Academic Document Processing
Im2latex Base
A VisionEncoderDecoder model for generating LaTeX formulas from images, utilizing Swin Transformer encoder and GPT-2 decoder architecture
Image-to-Text
Transformers

I
Matthijs0
56
1
Im2latex
MIT
A baseline model based on VisionEncoderDecoderModel, fine-tuned on datasets for generating LaTeX formulas from images.
Image-to-Text
Transformers

I
DGurgurov
288
11
Zhen Latex OCR
Apache-2.0
An OCR model specialized in recognizing Chinese-English mixed LaTeX formulas, supporting local offline CPU inference
Image-to-Text
Transformers

Z
MixTex
885
31
Pix2text Mfd
MIT
Pix2Text's Mathematical Formula Detection (MFD) model for recognizing mathematical formulas in images
Text Recognition Other
P
breezedeus
1,369
3
Cephalo LaTeX Phi 3 Vision 128k 4b Beta
Apache-2.0
Cephalo is a series of vision-language large models focused on multimodal materials science. The current version specializes in converting mathematical formula images into LaTeX code.
Image-to-Text
Transformers

C
lamm-mit
16
0
Nougat For Formula
Apache-2.0
A fine-tuned mathematical formula recognition model based on Nougat-small, excelling in extracting LaTeX formula code from images
Image-to-Text
Transformers

N
CuiSiwei
40
5
Nougat Small
Nougat is a vision-language model based on the Donut architecture, specifically designed for converting scientific PDFs into Markdown format.
Image-to-Text
Transformers

N
facebook
10.28k
27
Featured Recommended AI Models